智能论文笔记

Clustering and Analysis of GPS Trajectory Data using Distance-based Features

Zann Koh , Yuren Zhou , Billy Pik Lik Lau , Ran Liu , Keng Hua Chong , Chau Yuen

分类：机器学习

2022-12-01

The proliferation of smartphones has accelerated mobility studies by largely increasing the type and volume of mobility data available. One such source of mobility data is from GPS technology, which is becoming increasingly common and helps the research community understand mobility patterns of people. However, there lacks a standardized framework for studying the different mobility patterns created by the non-Work, non-Home locations of Working and Nonworking users on Workdays and Offdays using machine learning methods. We propose a new mobility metric, Daily Characteristic Distance, and use it to generate features for each user together with Origin-Destination matrix features. We then use those features with an unsupervised machine learning method, $k$-means clustering, and obtain three clusters of users for each type of day (Workday and Offday). Finally, we propose two new metrics for the analysis of the clustering results, namely User Commonality and Average Frequency. By using the proposed metrics, interesting user behaviors can be discerned and it helps us to better understand the mobility patterns of the users.

translated by 谷歌翻译

Towards Interpreting Vulnerability of Multi-Instance Learning via Customized and Universal Adversarial Perturbations

Yu-Xuan Zhang , Hua Meng , Xuemei Cao , Zhengchun Zhou , Mei Yang

分类：计算机视觉 | 机器学习

2022-11-30

Multi-instance learning (MIL) is a great paradigm for dealing with complex data and has achieved impressive achievements in a number of fields, including image classification, video anomaly detection, and far more. Each data sample is referred to as a bag containing several unlabeled instances, and the supervised information is only provided at the bag-level. The safety of MIL learners is concerning, though, as we can greatly fool them by introducing a few adversarial perturbations. This can be fatal in some cases, such as when users are unable to access desired images and criminals are attempting to trick surveillance cameras. In this paper, we design two adversarial perturbations to interpret the vulnerability of MIL methods. The first method can efficiently generate the bag-specific perturbation (called customized) with the aim of outsiding it from its original classification region. The second method builds on the first one by investigating the image-agnostic perturbation (called universal) that aims to affect all bags in a given data set and obtains some generalizability. We conduct various experiments to verify the performance of these two perturbations, and the results show that both of them can effectively fool MIL learners. We additionally propose a simple strategy to lessen the effects of adversarial perturbations. Source codes are available at https://github.com/InkiInki/MI-UAP.

translated by 谷歌翻译

GreenPLM: Cross-lingual pre-trained language models conversion with (almost) no cost

Qingcheng Zeng , Lucas Garay , Peilin Zhou , Dading Chong , Yining Hua , Jiageng Wu , Yikang Pan , Han Zhou , Jie Yang

分类：自然语言处理

2022-11-13

While large pre-trained models have transformed the field of natural language processing (NLP), the high training cost and low cross-lingual availability of such models prevent the new advances from being equally shared by users across all languages, especially the less spoken ones. To promote equal opportunities for all language speakers in NLP research and to reduce energy consumption for sustainability, this study proposes an effective and energy-efficient framework GreenPLM that uses bilingual lexicons to directly translate language models of one language into other languages at (almost) no additional cost. We validate this approach in 18 languages and show that this framework is comparable to, if not better than, other heuristics trained with high cost. In addition, when given a low computational cost (2.5\%), the framework outperforms the original monolingual language models in six out of seven tested languages. We release language models in 50 languages translated from English and the source code here.

translated by 谷歌翻译

Towards Few-Shot Open-Set Object Detection

Binyi Su , Hua Zhang , Jingzhi Li , Zhong Zhou

分类：计算机视觉

2022-10-28

Open-set object detection (OSOD) aims to detect the known categories and identify unknown objects in a dynamic world, which has achieved significant attentions. However, previous approaches only consider this problem in data-abundant conditions, while neglecting the few-shot scenes. In this paper, we seek a solution for the few-shot open-set object detection (FSOSOD), which aims to quickly train a detector based on few samples while detecting all known classes and identifying unknown classes. The main challenge for this task is that few training samples induce the model to overfit on the known classes, resulting in a poor open-set performance. We propose a new FSOSOD algorithm to tackle this issue, named Few-shOt Open-set Detector (FOOD), which contains a novel class weight sparsification classifier (CWSC) and a novel unknown decoupling learner (UDL). To prevent over-fitting, CWSC randomly sparses parts of the normalized weights for the logit prediction of all classes, and then decreases the co-adaptability between the class and its neighbors. Alongside, UDL decouples training the unknown class and enables the model to form a compact unknown decision boundary. Thus, the unknown objects can be identified with a confidence probability without any pseudo-unknown samples for training. We compare our method with several state-of-the-art OSOD methods in few-shot scenes and observe that our method improves the recall of unknown classes by 5%-9% across all shots in VOC-COCO dataset setting.

translated by 谷歌翻译

METS-CoV: A Dataset of Medical Entity and Targeted Sentiment on COVID-19 Related Tweets

Peilin Zhou , Zeqiang Wang , Dading Chong , Zhijiang Guo , Yining Hua , Zichang Su , Zhiyang Teng , Jiageng Wu , Jie Yang

分类：自然语言处理

2022-09-28

Covid-19-Pandemic继续在社交媒体上提出各种讨论或辩论的主题。为了探索大流行对人们生活的影响，了解公众对与大流行有关的实体（例如药物，疫苗）对社交媒体的关注和态度至关重要。但是，对现有命名实体识别（NER）或目标情感分析（TSA）数据集培训的模型具有有限的理解与COVID相关的社交媒体文本的能力有限，因为这些数据集并未从医学角度设计或注释。本文释放了Mets-COV，这是一种包含医疗实体的数据集和与COVID相关的推文中的目标情感。 Mets-COV包含10,000条带有7种实体的推文，包括4种医疗实体类型（疾病，药物，症状和疫苗）和3种通用实体类型（人，位置和组织）。为了进一步调查推文用户对特定实体的态度，选择了4种类型的实体（人，组织，药物和疫苗），并用用户情感注释，从而产生了具有9,101个实体（5,278个推文）的目标情感数据集。据我们所知，METS-COV是第一个收集与COVID相关推文的医疗实体和相应情感的数据集。我们通过广泛的实验对经典机器学习模型和最先进的深度学习模型进行基准测试。结果表明，该数据集在NER和TSA任务方面都有大量改进的空间。 METS-COV是开发更好的医学社交媒体工具并促进计算社会科学研究的重要资源，尤其是在流行病学方面。我们的数据，注释准则，基准模型和源代码公开可用（https://github.com/ylab-open/mets-cov），以确保可重复性。

translated by 谷歌翻译

PointCAT: Contrastive Adversarial Training for Robust Point Cloud Recognition

Qidong Huang , Xiaoyi Dong , Dongdong Chen , Hang Zhou , Weiming Zhang , Kui Zhang , Gang Hua , Nenghai Yu

分类：计算机视觉

2022-09-16

尽管在各种应用中取得了突出的性能，但点云识别模型经常遭受自然腐败和对抗性扰动的困扰。在本文中，我们深入研究了点云识别模型的一般鲁棒性，并提出了点云对比对抗训练（PointCat）。 PointCat的主要直觉是鼓励目标识别模型缩小清洁点云和损坏点云之间的决策差距。具体而言，我们利用有监督的对比损失来促进识别模型提取的超晶体特征的对齐和均匀性，并设计一对带有动态原型指南的集中式损失，以避免这些特征与其属于其属于其归属类别群的偏离。为了提供更具挑战性的损坏点云，我们对噪声生成器以及从头开始的识别模型进行了对手训练，而不是将基于梯度的攻击用作内部循环，例如以前的对手训练方法。全面的实验表明，在包括各种损坏的情况下，所提出的PointCat优于基线方法，并显着提高不同点云识别模型的稳健性，包括各向同性点噪声，LIDAR模拟的噪声，随机点掉落和对抗性扰动。

translated by 谷歌翻译

MIntRec: A New Dataset for Multimodal Intent Recognition

Hanlei Zhang , Hua Xu , Xin Wang , Qianrui Zhou , Shaojie Zhao , Jiayan Teng

分类：人工智能

2022-09-09

多模式意图识别是理解现实世界中人类语言的重要任务。大多数现有意图识别方法在利用基准数据集的限制中利用多模式信息的局限性，仅使用文本信息。本文介绍了一个用于多模式意图识别（MinTreec）的新型数据集，以解决此问题。它根据电视系列超市收集的数据制定了粗粒和细粒度的分类法。该数据集由2,224个具有文本，视频和音频模式的高质量样本组成，并在二十个意图类别中具有多模式注释。此外，我们在每个视频段中提供带注释的扬声器框架框，并实现扬声器注释的自动过程。 MinTrec对研究人员有助于挖掘不同方式之间的关系，以增强意图识别的能力。我们通过适应三种强大的多模式融合方法来构建基准，从每种模式和模型跨模式相互作用中提取特征。广泛的实验表明，采用非语言方式与仅文本模式相比，实现了实质性改进，这表明使用多模式信息进行意图识别的有效性。表现最佳的方法与人类之间的差距表明了这项任务对社区的挑战和重要性。完整的数据集和代码可在https://github.com/thuiar/mintrec上使用。

translated by 谷歌翻译

Multi-agent Deep Reinforcement Learning for Charge-sustaining Control of Multi-mode Hybrid Vehicles

Min Hua , Quan Zhou , Cetengfei Zhang , Hongming Xu , Wei Liu

分类：机器学习

2022-09-06

运输电气化需要越来越多的电动机（例如电动机和电动机存储系统）上的电动机，并且对电动电气的控制通常涉及多个输入和多个输出（MIMO）。本文重点介绍了基于多代理增强学习（MARL）算法的多模式混合动力汽车的能源管理策略的在线优化，该算法旨在解决MIMO控制优化，而大多数现有方法仅处理单个输出控制。基于对基于深层确定性策略梯度（DDPG）基于的MARL算法优化的多模式混合动力汽车（HEV）的能源效率的分析，提出了一种新的与多代理的合作网络物理学习。然后，通过一种新颖的随机方法来设定学习驾驶周期，以加快训练过程。最终，网络设计，学习率和政策噪声被纳入了敏感性分析中，并确定了基于DDPG的算法参数，并研究了与多代理的不同关系的学习绩效，并证明与与不完全独立的关系比率0.2是最好的。与单一代理和多代理的同情研究表明，多代理可以在单一代理方案中获得总能量的4％提高。因此，MAL的多目标控制可以实现良好的优化效果和应用效率。

translated by 谷歌翻译

Uformer-ICS: A Specialized U-Shaped Transformer for Image Compressive Sensing

Kuiyuan Zhang , Zhongyun Hua , Yuanman Li , Yushu Zhang , Yicong Zhou

分类：计算机视觉

2022-09-05

最近，一些研究在图像压缩感测（CS）任务中应用了深层卷积神经网络（CNN），以提高重建质量。但是，卷积层通常具有一个小的接受场。因此，使用CNN捕获远程像素相关性是具有挑战性的，这限制了其在Image CS任务中的重建性能。考虑到这一限制，我们为图像CS任务（称为uformer-ics）提出了一个U形变压器。我们通过将CS的先验投影知识集成到原始变压器块中，然后使用基于投影基于投影的变压器块和残留卷积块构建对称重建模型来开发一个基于投影的变压器块。与以前的基于CNN的CS方法相比，只能利用本地图像特征，建议的重建模型可以同时利用图像的局部特征和远程依赖性，以及CS理论的先前投影知识。此外，我们设计了一个自适应采样模型，该模型可以基于块稀疏性自适应采样图像块，这可以确保压缩结果保留在固定采样比下原始图像的最大可能信息。提出的UFORFORFOR-ICS是一个端到端框架，同时学习采样和重建过程。实验结果表明，与现有的基于深度学习的CS方法相比，它的重建性能明显优于重建性能。

translated by 谷歌翻译

Large-scale Entity Alignment via Knowledge Graph Merging, Partitioning and Embedding

Kexuan Xin , Zequn Sun , Wen Hua , Wei Hu , Jianfeng Qu , Xiaofang Zhou

分类：机器学习 | 人工智能

2022-08-23

实体对齐是知识图融合中的至关重要任务。但是，大多数实体对准方法都有可伸缩性问题。最近的方法通过将大型公斤分成小块来解决这个问题，以嵌入和对齐学习。但是，这种分区和学习过程导致结构和对齐过度损失过多。因此，在这项工作中，我们提出了一种可扩展的基于GNN的实体对准方法，以从三个角度降低结构和对齐损失。首先，我们提出一种基于中心性的子图生成算法，以回顾一些具有不同子图之间桥梁的地标实体。其次，我们介绍了自我监督的实体重建，以从不完整的邻里子图中恢复实体表示形式，并设计了跨纸笔负面抽样，以在对齐学习中纳入其他子图中的实体。第三，在推理过程中，我们合并子图的嵌入，以制作一个单个空间进行对齐搜索。基准开放数据集和提议的大型DBPEDIA1M数据集的实验结果验证了我们方法的有效性。

translated by 谷歌翻译